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Abstract 

We consider systems of word equations and their solution sets. We discuss 
some fascinating properties of those, namely the size of a maximal indepen¬ 
dent set of word equations, and proper chains of solution sets of those. We 
recall the basic results, extend some known results and formulate several 
fundamental problems of the topic. 
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1 Introduction 

Theory of word equations is a fundamental part of combinatorics on words. It is a 
challenging topic of its own which has a number of connections and applications, 
e.g., in pattern unification and group representations. There have also been several 
fundamental achievements in the theory over the last few decades. 

Decidability of the existence of a solution of a given word equation is one 
fundamental result due to Makanin [16j. This is in contrast to the same problem 
on Diophantine equations, which is undecidable m Although the complexity 
of the above satisfiability problem for word equations is not known, a nontrivial 
upper bound has been proved: it is in PSPACE [I9j. 

Another fundamental property of word equations is the so-called Ehrenfeucht 
compactness property. It guarantees that any system of word equations is equiv¬ 
alent to some of its finite subsystems. The proofs (see [Ij and 0) are based on a 
transformation of word equations into Diophantine equations and then an appli¬ 
cation of Hilbert’s basis theorem. Although we have this finiteness property, we 
do not know any upper bound, if it exists, for the size of an equivalent subsystem 
in terms of the number of unknowns. And this holds even in the case of three 
unknown systems of equations. In free monoids an equivalent formulation of the 
compactness property is that each independent system of word equations is finite, 
independent meaning that the system is not equivalent to any of its proper sub¬ 
systems. We analyze in this paper the size of the maximal independent systems 
of word equations. 
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As a related problem we define the notion of decreasing chains of word equa¬ 
tions. Intuitively, this asks how long chains of word equations exist such that 
the set of solutions always properly diminishes when a new element of the chain 
is taken into the system. Or more intuitively, how many proper constraints we 
can define such that each constraint reduces the set of words satisfying these con¬ 
straints. It is essentially the above compactness property which guarantees that 
these chains are finite. 

Another fundamental property of word equations is the result of Hmelevskii 
m stating that for each word equation with three unknowns its solution set 
is finitely parameterizable. This result is not directly related to our considera¬ 
tions, but its intriguity gives, we believe, a strong explanation and support to 
our view that our open problems, even the simplest looking ones, are not trivial. 
Hmelevskii’s argumentation is simplified in the extended abstract pH], and used 
in |2Qj to show that the satisfiability problem for three unknown equations is in 
NP. A full version of these two conference articles has been submitted [IF] . 

The goal of this note is to analyze the above maximal independent systems 
of equations and maximal decreasing chains of word equations, as well as search 
for their relations. An essential part is to propose open problems on this area. 
The most fundamental problem asks whether the maximal independent system of 
word equations with n unknowns is bounded by some function of n. Amazingly, 
the same problem is open for three unknown equations, although we do not know 
larger than three equation systems in this case. 

2 Systems and Chains of Word Equations 

The topics of this paper are independent systems and chains of equations in semi¬ 
groups. We are mostly interested in free monoids; in this case the equations 
are constant-free word equations. We present some questions about the sizes of 
such systems and chains, state existing results, give some new ones, and list open 
problems. 

Let S' be a semigroup and H be an alphabet of variables. We consider equations 
U = V, where U, V € H + . A morphism h : —> S is a solution of this equation 

if h(U) = h(V). (If S is a monoid, we can use H* instead of H + .) 

A system of equations is independent if it is not equivalent to any of its proper 
subsystems. In other words, equations Ei form an independent system of equations 
if for every i there is a morphism hi which is not a solution of E t but which is 
a solution of all the other equations. This definition works for both finite and 
infinite systems of equations. 

We define decreasing chains of equations. A finite sequence of equations 
Ei,.... E m is a decreasing chain if for every i e {0, ...,m — 1} the system 
E\,..., E t is inequivalent to the system E\,... ,Ei + \. An infinite sequence of 
equations E\ , E 2 ,... is a decreasing chain if for every i > 0 the system E\ ,..., Ei 
is inequivalent to the system E \,..., Ei + \. 

Similarly we define increasing chains of equations. A sequence of equations 
Ei, ..., E m is an increasing chain if for every i € {1,..., m} the system Ei, ..., E m 
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is inequivalent to the system E l+ i,..., E m . An infinite sequence of equations 
E\. E' 2 t ■ ■ ■ is an increasing chain if for every i > 1 the system Ei, Ei+i,... is 
inequivalent to the system E l+ i, Ei +2 ,.... 

Now Ei,, E m is an increasing chain if and only if E m ,..., E\ is a decreasing 
chain. However, for infinite chains these concepts are essentially different. Note 
that a chain can be both decreasing and increasing, for example, if the equations 
form an independent system. 

We will consider the maximal sizes of independent systems of equations and 
chains of equations. If the number of unknowns is n, then the maximal size of an 
independent system is denoted by IS(n). We use two special symbols ub and oo for 
the infinite cases: if there are infinite independent systems, then IS(n) = oo, and if 
there are only finite but unboundedly large independent systems, then IS(n) = ub. 
We extend the order relation of numbers to these symbols: k < ub < oo for every 
integer k. Similarly the maximal size of a decreasing chain is denoted by DC(n), 
and the maximal size of an increasing chain by IC(n). 

Often we are interested in the finiteness of DC(n), or its asymptotic behaviour 
when n grows. However, if we are interested in the exact value of DC(n), then 
some technical remarks about the definition are in order. First, the case i = 0 
means that there is a solution which is not a solution of the first equation E \; that 
is, E\ cannot be a trivial equation like U = U. If this condition was removed, 
then we could always add a trivial equation in the beginning, and DC(n) would 
be increased by one. Second, we could add the requirement that there must be 
a solution which is a solution of all the equations E\,... ,E m , and the definition 
would remain the same in the case of free monoids. However, if we consider free 
semigroups, then this addition would change the definition, because then E m could 
not be an equation with no solutions, like xx = x in free semigroups. This would 
decrease DC(n) by one. 

3 Relations Between Systems and Chains 

Independent systems of equations are a well-known topic (see, e.g., 0)- Chains of 
equations have been studied less, so we prove here some elementary results about 
them. The following theorem states the most basic relations between IS, DC and 

IC. 

Theorem 3.1. For every n, IS(n) < DC(n),IC(n). //DC(n) < ub orIC(n) < ub, 
then DC(n) = IC(n). 

Proof. Every independent system of equations is also a decreasing and increasing 
chain of equations, regardless of the order of the equations. This means that 
IS(n) < DC(n),IC(n). 

A finite sequence of equations is a decreasing chain if and only if the reverse 
of this sequence is an increasing chain. Thus DC(n) = IC(n), if DC(n) < ub or 
IC(n) < ub. □ 

A semigroup has the compactness property if every system of equations has 
an equivalent finite subsystem. Many results on the compactness property are 
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collected in [8j. In terms of chains, the compactness property turns out to be 
equivalent to the property that every decreasing chain is finite. 

Theorem 3.2. A semigroup has the compactness property if and only i/DC(n) < 
ub for every n. 

Proof. Assume first that the compactness property holds. Let E\. E 2 , ■ ■ ■ be an 
infinite decreasing chain of equations. As a system of equations, it is equivalent 
to some finite subsystem Ej t ,..., Ei k , where i\ < ■ ■ ■ < i^. But now E\.... Ei k is 
equivalent to E \,..., Ei k+ \. This is a contradiction. 

Assume then that DC(n) < ub. Let E\, E^, ■ ■ ■ be an infinite system of equa¬ 
tions. If there is an index N such that E\,, Ej is equivalent to E \,..., Ei+\ for 
all i > N, then the whole system is equivalent to E\, ..., Epf. If there is no such 
index, then let i\ < *2 < ... be all indexes such that E\.... Ei k is not equivalent 
to Si,..., Ei k+ 1 - But then E^, Ei 2 , ... is an infinite decreasing chain, which is a 
contradiction. □ 

The next example shows that the values of IS, DC and IC can differ signifi¬ 
cantly. 

Example 3.3. We give an example of a monoid where IS(1) = 1, DC(1) = ub 
and IC(1) = 00 . The monoid is 

(cq,U2, • • ■ | a^aj — ajai, a^ — a ^}. 

Now every equation on one unknown is of the form x l = x 3 . If i < j, then this is 
equivalent to x l = x* +1 . So all nontrivial equations are, up to equivalence, 

o o o 

ry* - I ryt ^ - nr* ry* - ry* ^ 

•h - -L 1 U/ - iL/ 1 ih - a/ j • . . ^ 

and these have strictly increasing solution sets. Thus IC(1) = 00 , DC(1) = ub 
and IS(1) = 1. 

4 Free Monoids 

From now on we will consider free monoids and semigroups. The bounds related 
to free monoids are denoted by IS, DC and IC, and the bounds related to free 
semigroups, by IS+, DC + and IC + . 

We give some definitions related to word equations and make some easy ob¬ 
servations about the relations between maximal sizes of independent systems and 
chains, assuming these are finite. 

A solution h is periodic if there exists a t € S such that every h(x), where 
x G S, is a power of t. Otherwise h is nonperiodic. An equation U = V is balanced 
if every variable occurs as many times in U as in V. 

The maximal size of an independent system in a free monoid having a non¬ 
periodic solution is denoted by IS^n). The maximal size of a decreasing chain 
having a nonperiodic solution is denoted by DC^n). Similar notation can be used 
for free semigroups. 
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Every independent system of equations E\,..., E m is also a chain of equations, 
regardless of the order of the equations. If the system has a nonperiodic solution, 
then we can add an equation that forces the variables to commute. If the equations 
in the system are also balanced, then we can add equations X{ = 1 for all variables 
x \,..., x n , and thus get a chain of length m + n + 1. If they are not balanced, 
then we can add at least one of these equations. 

In all cases we obtain the inequalities IS'(n) < IS(n) < IS , (?r) + l and DC , (n) + 
2 < DC(n) < DC ; (n) + n + 1, as well as IS(n) + 1 < DC(n) and IS'(n) < DC^n). 
In the case of free semigroups we derive similar inequalities. Thus IS / and DC ; 
are basically the same as IS and DC, if we are only interested in their finiteness 
or asymptotic growth. 

It was conjectured by Ehrenfeucht in a language theoretic setting that the 
compactness property holds for free monoids. This conjecture was reformulated in 
terms of equations in |2], and it was proved independently by Albert and Lawrence 
p] and by Guba [6j. 

Theorem 4.1. DC(n) < ub, and hence also IS(n) < ub. 

The proofs are based on Hilbert’s basis theorem. The compactness property 
means that DC(n) < ub for every n. No better upper bounds are known, when n > 
2. Even the seemingly simple question about the size of IS^S) is still completely 
open; the only thing that is known is that 2 < IS^S) < ub. The lower bound is 
given by the example xyz = zyx, xyyz = zyyx. 


5 Three and Four Unknowns 


The cases of three and four variables have been studied in [3], The article gives 
examples showing that IS' + (3) > 2, DC + (3) > 6, 18^(4) > 3 and DC + (4) > 9. We 
are able to give better bounds for DC + (3) and DC(4). 

First we assume that there are three unknowns x, y , z. There are trivial 
examples of independent systems of three equations, for example, x 2 = y,y 2 = 
z,z 2 = x, so IS + (3) > 3. There are also easy examples of independent pairs 
of equations having a nonperiodic solution, like xyz = zyx, xyyz = zyyx, so 
18^(3) > 2. Amazingly, no other bounds are known for IS + (3), IS^S), IS(3) or 
IS'(3). 

The following chain of equations shows that DC(3) > 7: 


xyz = zxy, 
xyxzyz = zxzyxy, 
xz = zx, 
xy = yx, 
x = 1, 

y = i, 

z = 1, 


x = a, y = b, z = abab 
x = a, y = b, z = ab 

x = a, y = b, z = 1 

x = a, y = a, z = a 

x = 1, y = b, z = a 

x = 1, y = 1, z = a 

x = 1 , y = 1 , z = 1. 
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Here the second column gives a solution which is not a solution of the equation on 
the next row but is a solution of all the preceding equations. Also DC + (3) > 7, 
as shown by the chain 


xxyz = zxyx , 
xxyxzyz = zzyxxyx, 
xz = zx, 
xy = yx, 
x = y, 

X = z, 

ry* ry» ry* 

ih it/ it/ 


x = a, y = b, z = aabaaba 
x = a, y = b, 2 = aaba 

x = a, y = b, z = a 

x = a, y = aa, z = a 
x = a, y = a, z = aa 

x = a, y = a, z = a 

no solutions. 


If there are three variables, then every independent pair of equations having 
a nonperiodic solution consists of balanced equations (see m■ it follows that 
IS'(3) + 4 < DC(3). There are also some other results about the structure of 
equations in independent systems on three unknowns (see and 0 ). 

If we add a fourth unknown t, then we can trivially extend any independent 
system by adding the equation t = x. This gives IS+(4) > 4 and IS(j_(4) > 3. For 
chains the improvements are nontrivial. The following chain of equations shows 
that DC(4) > 12: 


xyz = zxy , x 

xyt = txy. x 

xyxzyz = zxzyxy , x 

xyxtyt = txtyxy, x 

xyxztyzt = ztxztyxy , x 

xz = zx , x 

xt = tx, x 

xy = yx , x 

x = 1, x 

y = i, x 

z = 1, x 

t = 1, x 


= a, y = b, z = abab, t = a 
= a, y = b, z = abab, t = abab 

= a, y = b, z = ab, t = abab 

= a, y = b, z = ab, t = ab 

= a, y = b, z = ab, t = 1 

= a, y = b, z = 1, t = ab 

= a, y = b, z = 1, t = 1 

= a, y = a, z = a, t = a 

= 1, y = a, z = a, t = a 

= 1, y = 1, z = a, t = a 

= 1, y = 1 , 2 = 1 , f = a 

= 1, y = 1, 2 = 1, t = 1. 


The next theorem sums up the new bounds given in this section. 

Theorem 5.1. DC+(3) > 7 and DC(4) > 12. 

6 Lower Bounds 

In [I3j it is proved that IS(n) = D(?r 4 ) and IS_|_(n) = 0(n 3 ). The former is 
proved by a construction that uses n = 10m variables and gives a system of m 4 
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equations. Thus IS(n) is asymptotically at least n 4 /10000. We present here a 
slightly modified version of this construction. By ’’reusing” some of the unknowns 
we get a bound that is asymptotically n 4 /1536. 

Theorem 6.1. If n = 4m, then IS^n) > m 2 (m — l)(m — 2)/6. 

Proof. We use unknowns x t . ijj. z. L . tj. where 1 < i < m. The equations in the 
system are 

E(i,j,k,l) : XiXjXkyiyjykZiZjZkti = tiXiXjX k yiyjykZiZjZ k , 

where i,j, k, l £ {1,..., m} and i < j < 
then 

fab, if r £ {i,j, k} 
x r = < 

11, otherwise 
_ J 6a, if r € {i,j,k} 

Zf — \ 

I 1, otherwise 

is not a solution of E(i,j,k,l), but is a solution of all the other equations. Thus 
the system is independent. □ 

The idea behind this construction (both the original and the modified) is that 
{ababa) k = (ab) k a k (ba) k holds for k < 3, but not for k = 3. It was noted in |)T5] 
that if we could find words Ui such that (u\ ... u m ) k = u k ... u^ holds for k < K, 
but not for k = I\, then we could prove that IS(n) = fl(n A+1 ). However, it has 
been proved that such words do not exist for K > 5 (see Cl), and conjectured 
that such words do not exist for K = 4. 

For small values of n it is better to use ideas from the constructions showing 
that DC(3) > 7 and DC(4) > 12. This gives IS'(n) > (n 2 — 5n + 6)/2 and 
DC(n) > (n 2 + 3n — 4)/2. The equations in the system are 


k. If i,j, k,l £ {1,, m} and i < j < k, 


y r = 




\a, if r£{i,j,k} 

I 1, otherwise 

ababa, if r = l 
1, otherwise 


xyxziZjyziZj = ZiZjXZiZjyxy, 

where i , j £ {1,..., n — 2} and i < j. The equations in the chain are 


xyz k = z k xy, 
xyxz k yz k = z k xz k yxy, 

xyxziZjyziZj = ZiZjXZiZjyxy, 

xz k = z k x , 
xy = yx, 
x = 1, 

y = i, 


Zk = 1, 


where i, j £ {1,..., n — 2}, i < j and k £ {1,..., n — 2}. Here we should first take 
the equations on the first row in some order, then the equations on the second row 
in some order, and so on. 
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We conclude this section by mentioning a related question. It is well known 
that any nontrivial equation on n variables forces a defect effect; that is, the values 
of the variables in any solution can be expressed as products of re — 1 words (see 
[7] for a survey on the defect effect). If a system has only periodic solutions, then 
the system can be said to force a maximal defect effect, so IS 7 (re) is the maximal 
size of an independent system not doing that. But how large can an independent 
system be if it forces only the minimal defect effect, that is, the system has a 
solution in which the variables cannot be expressed as products of re — 2 words? 
In |13] it is proved that there are such systems of size 12(re 3 ) in free monoids and 
of size P(re 2 ) in free semigroups. Again, no upper bounds are known. 


7 Concluding Remarks and Open Problems 

To summarize, we list a few fundamental open problems about systems and chains 
of equations in free monoids. 


Question 1: Is IS(3) finite? 

Question 2: Is DC(3) finite? 

Question 3: Is IS (re) finite for every re? 
Question 4: Is DC (re.) finite for every re? 


A few remarks on these questions are in order. First we know that each of 
these values is at most ub. Second, if the answer to any of the questions is ”yes”, 
a natural further question is: What is an upper bound for this value, or more 
sharply, what is the best upper bound, that is, the exact value? For the lower 
bounds the best what is known, according to our knowledge, is the following 


Question 1: IS(3) > 3, 

Question 2: DC(3) > 7, 

Question 3: IS(re) = P(re 4 ), 

Question 4: DC(re) = fl(n 4 ). 

A natural sharpening of Question [3] (and [4j asks whether these values are expo¬ 
nentially bounded. 

A related question to Question [T] is the following amazing open problem from 
[2] (see, e.g., [3j and [1] for an extensive study of it): 


Question 5: Does there exist an independent system of three equations with three 
unknowns having a nonperiodic solution? 


As a summary we make the following remarks. As we see it, Question [3] is a 
really fundamental question on word equations or even on combinatorics on words 
as a whole. Its intriguity is revealed by Question [I] we do not know the answer 


even in the case of three unknowns. This becomes really amazing when we recall 
that still the best known lower bound is only 3! 

To conclude, we have considered equations over word monoids and semigroups. 
All of the questions can be stated in any semigroup, and the results would be dif¬ 
ferent. For example, in commutative monoids the compactness property (Theorem 
ED holds, but in this case the value of the maximal independent system of equa¬ 
tions is ub (see [JE3j). 

Note added on June 9, 2015 

When writing the original version of this article, we were not aware of any previous 
research on increasing chains. However, they were defined and studied already in 
1999 by Honkala [12] (in the case of free monoids). They were called descending 
chains in that article. Decreasing chains were called ascending chains and Theorem 
lT2l was proved in the case of free monoids. Most of the paper was devoted to 
descending chains and test sets. 

The following conjecture, which we state here using our notation, was given in 
[12) : In free monoids IC(n) < ub for all n. This appears to be a very interesting 
and difficult problem. Proofs of Ehrenfeucht’s conjecture are ultimately based on 
the fact that ideals in polynomial rings satisfy the ascending chain condition. As 
pointed out in [12], the same is not true for the descending chain condition, so the 
above conjecture could be expected to be significantly more difficult to prove than 
Ehrenfeucht’s conjecture was. Of course, if DC(n) < ub, then IC(n) = DC(n) by 
Theorem 13.11 
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